11-4 PCA for Face Recognition

This section explains the use of PCA for face recognition.
First of all, you need to read the face dataset using the following script:
Example 1: faceRecog/faceDataRead01.m

The face data is then save as a structure array faceData of size 400 in faceData.mat.
You can display one of the face image as follows:
Example 2: faceRecog/faceDisplay01.m

If you want to display face images of the first 4 persons, use "montage" to do so:
Example 3: faceRecog/faceDisplay02.m

Alternatively, you can display all the images as a big one:
Example 4: faceRecog/faceDisplay03.m

To try PCA on these face images, we need to find the mean face first:
Example 5: faceRecog/meanFaceDisplay01.m

Now we are ready to put all face images (after mean subtraction) into a big matrix A and find eigenvalues and eigenvectors of A*A' for PCA analysis. In particular, we can plot the percentage of total variance versus number of eigenvalues to get an idea as how PCA can "squeeze" the variance into the first few eigenvalues, as follows.
Example 6: faceRecog/varVsPcaEigNum01.m

Once we have the eigenvectors of A*A', we can display the first few eigenfaces:
Example 7: faceRecog/eigenFaceDisplay01.m

For purpose of visualization, we can project the original faces into 2D face space:
Example 8: faceRecog/face2dPcaProj01.m

The leave-one-out recognition rate of KNNC over the projected dataset is only 39.00%. This is a bit on the low side since the overall accounted variance of 2 eigenvalues is only about 30.52%.
To find the best number of eigenvalues, we can perform an exhaustive search:
Example 9: faceRecog/optPcaEigNum01.m

Now it is obvious that the best recognition rate is 98.75%, which occurs when the number of used eigenvectors is 28, with a corresponding variance coverage of 74.44%. Under the given accuracy, we know there are 5 missclassified faces. The following example shows these misclassified faces, together with their 7 nearest neighbors:
Example 10: faceRecog/dispMisclassified01.m

However, it seems the retrieved faces do not resemble the query ones. It should be noted that the distance is based on the projected faces in the face space (spanned by the 28 eigenvectors corresponding to the top-28 eigenvalues). As a result, it should be more reasonable to show the query faces and the retrieved one in the face space, as shown next:
Example 11: faceRecog/dispMisclassified02.m

However, the best recognition rate obtained above is overly optimistic since we used all faces for PCA projection when performing LOO test. A more objective way to estimate the recognition rate is to preclude the test data from PCA projection, as shown next. (Be warned that it takes a much longer time to run this example.)
Example 12: faceRecog/optPcaEigNum02.m

From the above example, the more objective recognition rate is 98.50%, which occurs when the dimension for PCA projection is 28.
You may wonder what is the image after being projected onto the face space spanned by the optimum 28 eigenfaces. Here is an example to show the original and projected images, and their difference.
Example 13: faceRecog/facePcaProjDiff01.m

On the other hand, if the original image is not a (human) face at all, then the difference between the original and projected images will be larger:
Example 14: faceRecog/facePcaProjDiff02.m

Here the distance between the original face and the projected one is much larger than the one obtained in the previous example. As a result, it is possbile to use DFFS (distance from face space) to determine is a given image is a human face or not.
We can plot the histogram of the 400 DFFS, together with the faces that have the maximum and minimum DFFS:
Example 15: faceRecog/pcaDffsHist01.m

For a give face, we can also find similar faces according to their distance within the face space:
Example 16: faceRecog/facePcaSimilarity01.m

More info:

Slides of PCA for face recognition.
References:

M.A. Turk and A.P. Pentland, "Face Recognition Using Eigenfaces", IEEE Conf. on Computer Vision and Pattern Recognition, pp. 586-591, 1991.

Data Clustering and Pattern Recognition (資料分群與樣式辨認)